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Synthetic rhgl gene 



AATGGGAGGAGTGGGAAAGACAGTGGCTATGGAGCTTGTTCCGGAGGTTGGGTTGGAAT 
CAAGTGTGCTCAGGGACAGGTTATTGTGATCCAGCTTCCTTGGAAGGGTTTGAGGGGTC 
GAATCACCGACAAAATTGGCCAACTTCAAGGCCTCAGGAAGCTTAGTCTTCATGATAAC 
CAAATTGGTGGTTCAATCCCTTCAACTTTGGGACTTCTTCCCAACCTTAGAGGGGTTCA 
GTTATTCAACAATAGGCTTACAGGTTCCATACCTCTTTCTTTAGGTTTCTGCCTTTGCT 
TCAAGTCTCTTGACCTCAGCAACAACTTGCTCACAGGAGCAATCCCTTATAGTCTTGCT 
AATTCCACTAAGCTTTATTGGCTTAACTTGAGTTTCAACTCCTTCTCTGGTCCTTTACC 
AGCTAGCCTAACTCACTCATTTTCTCTCACTTTTCTTTCTCTTCAAAATAACAATCTTT 
CTGGCTCCCTTCCTAACTCTTGGGGTGGGAATTCCAAGAATGGCTTCTTTAGGCTTCAA 
AATTTGATCCTAGATCATAACTTTTTCACTGGTGACGTTCCTGCTTCTTTGGGTAGCTT 
AAGAGAGCTCAATGAGATTTCCCTTAGTCATAATAAGTTTAGTGGAGCTATACCAAATG 
AAATAGGAACCCTTTCTAGGCTTAAGACACTTGACATTTCTAATAATGCCTTGAATGGG 
AACTTGCCTGCTACCCTCTCTAATTTATCCTCACTTACACTGCTGAATGCAGAGAACAA 
CCTCCTTGACAATCAAATCCCTCAAAGTTTAGGTAGATTGCGTAATCTTTCTGTTCTGA 
TTTTGAGTAGAAACCAATTTAGTGGACATATTCCTTCAAGCATTGCAAACATTTCCTCG 
CTTAGGCAGCTTGATTTGTC ACTGAATA ATTTCAGTGGAGAAATTCCAGTCTCCTTTGA 
CAGTCAGCGCAGTCTAAATCTCTTCAATGTTTCCTACAATAGCCTCTCAGGTTCTGTCC 
CCCCTCTGCTTGCCAAGAAATTTAACTCAAGCTCATTTGTGGGAAATATTCAACTATGT 
GGGTACAGCCCTTCAACCCCATGTCTTTCCCAAGCTCCATCACAAGGAGTCATTGCCCC 
ACCTCCTGAAGTGTCAAAACATCACCATCATAGGAAGCTAAGCACCAAAGACATAATTC 
TCATAGTAGCAGGAGTTCTCCTCGTAGTCCTGATTATACTTTGTTGTGTCCTGCTTTTC 
TGCCTGATCAGAAAGAGATCAACATCTAGGCCGGGAACGGCCAAGCCACCCGAGGGTAG 
AGCGGCCACTATGAGGACAGAAAAAGGAGTCCCTCCAGTTGCTGGTGGTGATGTTGAAG 
CAGGTGGGGAGGCTGGAGGGAAACTAGTCCATTTTGATGGACCAATGGCTTTTACAGCT 
GATGATCTCTTGTGTGCAACAGCTGAGATCATGGGAAAGAGCACCTATGGAACTGTTTA 
TAAGGCTATTTTGGAGGATGGAAGTCAAGTTGCAGTAAAGAG ATTGAGGGAAAAGAT CA 
C TAAAGGTCATAGAGAATTTGAAT CAGAAGTCAGTGT T C T AGGAAAAATTAGAC AC C C C 
AATGTTTTGGCTCTGAGGGCCTATTACTTGGGACCCAAAGGGGAAAAGCTTCTGGGTTT 
TGATACATGTCTAAAGGAAGTCTTGCTTCTTTCCTACATGGAAGGTTCGTGTGCTGGTT 
CTTTCATTAAAGTGTTGTGTGTGCTGGTCTTTAATTATAATTTGGAGTTTTACCTTAGT 
AATCTGTATAATT C TAAT C GGAGAAC AGTACAAACAAAAAC AC C TAAGGAAC AAC AC CT 
TANCTTTAATATACCATATCAATAAAGTGAAATATTTTCTTGGTCATCTTGATGCAGGG 
GGAACTGAACATTCATTATTGGCCACA AGATTAAAATA GCCCAAGCCTTGGCCCGGGCT 
TGTTTGCCTTCATTCCCAGGAGAACATCATACATGGGACCTCNCATCCAGCAATGTGTG 
GCTTGATGAAAAACAAATGCTAAAATTCAGATTTTGGTCTTTTTCGGGTTGATGTCAAC 
TGCTGCTAATTCCAACGTGATAGCTACAGCTGGAGCATTGGATACCGGGCACCTGAGCT 
CTCAAAGCTCAAGAAAGCAAACACTAAAACTGATATCTACAGTCTTGGTGTTATCTTGT 
TAGAACTCCTAACGAGGAAATCACCTGGGGTGTCTATGAATGGACTAGATTTGCCTCAG 
TGGGTTGCCTCAGTTGTCAAAGAGGAGTGGACAAATGAGGTTTTTGATGCAGACTTGAT 
GAGAGATGCATCCACAGTTGGCGACGAGTTGCTAAACACGTTGAAGCTCGCTTTGCACT 
GTGTTGATCCTTCTCCATCAGCACGACCAGAAGTTCATCAAGTTCTCCAGCAGCTGAAG 
AGATTAGACCAGAGAGATCAGTCACAGCCAGTCCCGGGGACGATATCGTATAGCACAAA 



Figure 7B 



TCTTTCATTCTTATATTATTTTTGCCTGTTTGAATGCTTGAATTTGTACATACTCATAC 
TACAATAAGGTGTAGTTCTGGTTAATTTTACCTCTACCTCAAAGCTGGGGTGTAATTCT 
GTTTCCTCCAAGGCACATAATAGTTGAAAATAGTTCTCAGGAGCATTCATTGTTTATTC 
TGCAAGATTCTCTTTCACGGCTGCTATCTTCTATGCATGCCCTGCCCATAAATGCATTA 
TGAAGAATTGTAACGGCTGTGTTTTTGGACTTCTTCAAAAAGTTTATGTTATTGCCAGG 
TGTATATATCAACATGTTTTAAAGATTTTCAAACAATCAGGTTTTAGATGTGGGTTTGC 
ATGCATGAGATTGGACTAGTGCGCTTGATGTAGTATAAAATATAAATTGTCCAATCAAG 
CACCCTCTACATGTCCAAATAATGGGCCTTATGAAACTTAATTTTTTAATTACAAACTA 
CAGTAATCTTTTTGAATAAAGATTTACAAATTACAACNGACATGTGAAGCNGCATCTTT 
NATTGNCAATCTTTCAAGTTACTCTATTATTTTCTGCN 
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Rhgl Peptide 

NGRSGKDSGYGACSGGWVGIKCAQGQVIVIQLPWKGLRGRIT 

DKIGQLQGLRKLSLHDNQIGGSIPSTLGLLPNLRGVQLFNNRLG 

SIP 

LSLGFCPLLQSLDLSNNLLTGAIP 

YSLANSTKLYWLNLSFNSFSGPLP 

ASLTHSFSLTFLSLQNNNLSGSLPNSWGG 

NS KNGF FRLQNL I LDHNF FTGDVP 

ASLGSLRELNEISLSHNKFSGAIP 

NE I GTLSRLKTLD I SNNALNGNLP 

ATL SNLS SLTLLNAENNLLDNQ I P 

QSLGRLRNLSVLILSRNQFSGHIP 

SSIANISSLRQLDLSLNNFSGEIP 

VSFDSQRSLNLSNVSYNSLSGSVP 

PLLAKKFNS S S FVGNIQLCGYSP 
STPCLSQ 

APSQGVIAPPPEVSKHHHHR 

KLSTKDIILIVAGVLLWLIILCCVLLFCLIRKRS 

TSKAGNGQATEGRAATMRTEKGVPPVAGGDVEAGGEAGGKLVHF 
DGPMAFTADDLLCAT AE I MGKSTYGTVYKAI LEDGSQVAVKRLR 
EKITKGHREFESEVSVLGKIRHPNGLALRAYYLGPKGEKLLVFD 
YMS KGGLLLF YMEGS CAGS F I KVLCVLVFNYNLEF YLSNLYNSN 
RRT VQT KT P KE QHLXFN I P YQ 

-SEIFSWSS-CRGN-TFIIGHKMKIXQDLAVACSPSFPETSYMD 
LXSSNVCX-NXMLKLQFWSFSVDVNCC-FQRDSYSWSIGIPGT- 
ALKAQESKH-N- YLQSWCYLVRTPNEEITWGVYEWTRFASVGCL 
S CQRGVDK- GF - CRLDERC I HS WRRVAKHVEARFALC - SFSIS 
TTRSSSSSPAAGRD-TREISHSQSHLPGRPLEPYSESY 
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l - Arabidopsis thaliana 

- Arabidopsis thaliana 

- Arabidopsis thaliana 
• Arabidopsis thaliana 
i - Arabidopsis thaliana 

- Arabidopsis thaliana 
Ipomoea nil (Japanese... 
Arabidopsis thaliana 



s producing significant alignments 

'0 hypothetical protein T18N14.120 
!7 hypothetical protein F18O21.60 - 

17 hypothetical protein F23E13.70 ■ 

18 hypothetical protein T5P19.20 - 
.0 hypothetical protein T20L15.160 
\0_ protein kinase homolog M3E9.3I 
i_6 receptor-like protein kinase 

19 receptor-like protein kinase 

J5 disease resistance protein Cf-2.2 - currant tomato 
i4 disease resistance protein Cf-2.1 - currant tomato 
■ 3_ disease resistance protein Hcr2-5D - tomato 
£ receptor-like protein kinase 5 (EC 2.7.1.-) precurso . . 
^9 receptor-like protein kinase-like protein - Arabidop . . 
3^ receptor protein kinase-like protein - Arabidopsis t.. 
_5 hypothetical protein F1C12 . 190 - Arabidopsis thaliana 
6 hypothetical protein T13K14.100 - Arabidopsis thaliana 
^8 hypothetical protein FSH11 . 170 - Arabidopsis thaliana 
_7 receptor-kinase like protein - Arabidopsis thaliana 
"2_ hypothetical protein F18F4.240 - Arabidopsis thaliana 
J9 probable serine/threonine-specif ic protein kinase (E.. 
4^ probable receptor protein kinase - rice 

1 receptor protein kinase homolog [imported] - soybean 

1_ receptor protein kinase-like protein - Arabidopsis t.. 
8 receptor-kinase like protein - Arabidopsis thaliana 
0_ receptor protein kinase homolog [imported] - soybean 
_5 receptor kinase-like protein - Arabidopsis thaliana 
6 brassinosteroid-insensitive protein BRI1 - Arabidops . . 

2 protein kinase homolog F22013.7 - Arabidopsis thaliana 

6 protein kinase Xa21 (EC 2.7.1.-), receptor type prec . . 
"£ kinase-like transmembrane protein TMKL1 precursor - . . 

4 protein kinase homolog T1F15.2 - Arabidopsis thaliana 

5 protein kinase Xa21 (EC 2.7.1.-) Al, receptor type -.. 

7 protein kinase homolog F6H11.160 - Arabidopsis thalian 

3 protein kinase Xa21 (EC 2.7.1.-), receptor type - rice 
7 protein kinase Xa21 (EC 2.7.1.-) D, receptor type - .. 
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>pir : T46070 hypothetical protein T18N14.12 0 - Arabidopsis thaliana 
Length = 83 S 

Score = 632 bits (1613) , Expect = e-180 

Identities = 329/550 (59%) , Positives = 400/550 (71%) , Gaps = 2/550 (0%) 
Frame = +1 

Query: 7 RSGKDSGYGACSGGWVGIKCAQGQVIVIQLPWKGLRGRITDKIGQLQGLRKLSLHDNQIG 186 

+S +S GW GIKC +GQV+ IQLPWKGL G I++KIGQL LRKLSLH+N I 

Sbjct: 72 KSWNNSASSQVCSGWAGIKCLRGQWAIQLPWKGLGGTISEKIGQLGSLRKLSLHNNVIA 131 

Query: 187 GSIPSTLGLLPNLRGVQLFNNRLTGSIPLSLGFCLCFKSLDLSNNLLTGAIPYSLANSTK 3 66 

GS+P +LG L +LRGV LFNNRL+GSIP+SLG C ++LDLS+N LTGAIP SL ST+ 
Sbjct: 132 GSVPRSLGYLKSLRGVYLFNNRLSGSIPVSLGNCPLLQNLDLSSNQLTGAIPPSIiTESTR 191 

Query: 3 67 LYWLNLSFNSFSGPLPASLTHSFSLTFLSLQNNNLSGSLPNSWGGNSKNGFFRLQNLILD 54 6 

LY LNLSFNS SGPLP S+ S++LTFL LQ+NNLSGS+P+ + NG L+ L LD 

Sbjct: 192 LYRLNLSFNSLSGPLPVSVARSYTLTFLDLQHNNLSGSIPDFF VNGSHPLKTLNLD 247 

Query: 547 HNFFTGDVPASLGSLRELNEISLSHNKFSGAIPNEIGTLSRLKTLDISNNALNGNLPATL 726 

HN F+G VP SL L E+S+SHN+ SG+IP E G L L++LD S N++NG +P + 

Sbjct: 248 HNRFSGAVPVSLCKHSLLEEVSISHNQLSGSIPRECGGLPHLQSLDFSYNSINGTIPDSF 307 

Query: 727 SNLSSLTLLNAENNLLDNQIPQSLGRLRNLSVLILSRNQFSGHIPSSIANISSLRQLDLS 906 

SNLSSL LN E+N L IP ++ RL NL+ L L RST+ +G IP +1 NIS +++LDLS 
Sbjct: 308 SNLSSLVSLNLESNHLKGPIPDAIDRLHNLTELNLKRNKINGPIPETIGNISGIKKLDLS 367 

Query: 907 LNNFSGEIPVSFDSQRSLNLFNVSYNSLSGSVPPLLAKKFNSSSFVGNIQLCGYSPSTPC 1086 

NNF+G IP+S L+ FNVSYN+LSG VPP+L+KKFNSSSF+GNIQLCGYS S PC 

Sbjct: 368 ENNFTGPIPLSLVHLAKLSSFNVSYNTLSGPVPPVLSKKFNSSSFLGNIQLCGYSSSNPC 427 

Query: 1087 LSQAPSQGVIAPP- -PEVSKHHHHRKLSTKDIILIVAGVLLWLIILCCVLLFCLIRKRS 1260 

+ + P + + HHHRKLS KD+ILI G LL +L++LCC+LL CLI+KR+ 

Sbjct: 42 8 PAPDHHHPLTLSPTSSQEPRKHHHRKLSVKDVILIAIGALLAILLLLCCILLCCLIKKRA 4 87 

Query: 1261 TSRPGTAKPPEGRAATMRTEKGVPPVAGGDVEAGGEAGGKLVHFDGPMAFTADDLLCATA 1440 

K +G+ T +EK V G AGGE GGKLVHFDGP FTADDLL CAT A 

Sbjct: 488 ALKQKDGKDKT- -SEKTVSAGVAGTASAGGEMGGKLVHFDGPFVFTADDLLCATA 540 

Query: 1441 EIMGKSTYGTVYKAILEDGSQVAVKRLREKITKGHREFESEVSVLGKIRHPNVLALRAYY 1620 

EIMGKSTYGT YKA LEDG++VAVKRLREK TKG +EFE EV+ LGKIRH N+LALRAYY 
Sbjct: 541 EIMGKSTYGTAYKATLEDGNEVAVKRLREKTTKGVKEFEGEVTALGKIRHQNLLALRAYY 600 

Query: 1621 LGPKGEKLLGFD 165 6 

LGPKGEKLL FD 
Sbjct: 601 LGPKGEKLLVFD 612 



Figure 7F 



Score = 185 bits (464) , Expect = le-45 

Identities = 93/161 (57%) , Positives = 122/161 (75%) , Gaps = 3/161 (1%) 
Frame = +2 

Query: 1943 GLVCLHS QENI IHGTSHP AMCGLMKNKC * NS DFGLFRVDWCC*FQRDSYSWSIGYR 2113 

GL LHS EN+IH + ++ ++ N+ D+GL R+ + + ++GYR 

Sbjct: 64 7 GLAHLHSNENMIH--ENLTASNILLDEQTNAHIADYGLSRLMTAAAATNVIATAGTLGYR 7 04 

Query: 2114 APELSKLKKANTKTDIYSLGVILLELLTRKSPGVSMNGLDLPQWVASWKEEWTNEVFDA 2293 

APE SK+K A+ KTD+YSLG+I+LELLT KSPG NG+DLPQWVAS+VKEEWTNEVFD 
Sbjct: 705 APEFSKIKNASAKTDVYSLGIIILELLTGKSPGEPTNGMDLPQWVASIVKEEWTNEVFDL 764 

Query: 2294 DLMRDAS TVGDE LLNTLKLALHCVDPS PS ARPE VHQVL QQL KRL 2425 

+LMR+ +VGDELLNTLKLALHCVDPSP+ARPE +QV++QL+ + 
Sbjct: 765 ELMRETQSVGDELLNTLKLALHCVDPSPAARPEANQWEQLEEI 80 8 



